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INTEGRATED METHOD FOR CHAOTIC 
TIME SERIES ANALYSIS 

The United States Government has rights in this invention pursuant to contract no. 
DE-AC05-96OR22464 between the United States Department of Energy and Lockheed 
5 Martin Energy Research Corporation. 

FIELD OF THE INVENTION 
The present invention relates to methods for analyzing nonlinear data from diverse 
sources, including but not limited to electroencephalogram and magnetoencephalogram 
brain wave data, electrocardiogram data, and motor current data from processes such as 
10 metal machining operations and electrically-driven pumps. 

CROSS-REFERENCE TO RELATED APPLICATIONS 
The following U.S. Patent Applications describe related inventions, Epileptic 
Seizure Detection by Nonlinear Methods, Serial No. 08/619,024, filed March 20, 1996; 
Epileptic Seizure Prediction by Nonlinear Methods, Serial No. 08/619,030, filed March 
15 20, 1996, and Method and Apparatus for Extraction of Low-Frequency Artifacts from 
Brain Waves for Alertness Detection, Serial No. 08/619,031, filed March 20, 1996. The 
entire disclosure of each of the above identified patent applications are hereby 
incorporated by reference. 

BACKGROUND OF THE INVENTION 
20 A deviation from normal process conditions can indicate performance degradation 

or the onset of imminent failure. Real- or near-real-time monitoring of the process 
condition can detect this deviation in data from current, voltage, acceleration, acoustic 
emission, pressure, power, and other measurable quantities. Prompt response to a 
significant deviation can help to maintain optimal performance and can avert disastrous 
25 failure. 

The theory of nonlinear dynamics provides a basis for understanding and 
potentially controlling many complex physical and engineering systems. For example, 
an extensive literature exists for nonlinear dynamics in the brain and related work as 
cited in U.S. Patent Application Serial No. 08/619,024, U.S. Patent Application Serial 
30 No. 08/619,030, and U.S. Patent Application Serial No. 08/619,031 hereinabove 
incorporated by reference. Other documents listed below refer to earlier work using 
linear and nonlinear methods of analysis. Numbers in parentheses within the text of the 



SUBSTITUTE SHEET (RULE 26) 



WO 98/49935 PCT/US98/09418 

2 

disclosure hereinafter refer to documents as listed below. 
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3. R. C. Watt and S. R. Hameroff, "Phase Space Analysis of Human EEG during 
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10. F. Takens, "Detecting Strange Attractors in Turbulence," in Lecture Notes in 
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11. D. S. Broomhead and G.P. King, "Extracting Qualitative Dynamics from 
Experimental Data," Physica D 20, 217-236 (1986). 

OBJECTS OF THE INVENTION 

It is an object of the present invention to provide new and improved methods and 
apparatus for discriminating between similar but different states in a nonlinear process. 

It is a further object of the present invention to provide new and improved 
methods for nonlinear analysis of experimental time serial data. 

The method obtains a phase-space (PS) representation of the process, which in 
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turn is converted into a probability density function (PDF). Condition change is 
monitored by comparing two PS-PDFs via a x 2 statistical measure. One example 
application involves monitoring of brain waves to distinguish various states in an 
epileptic patient A second example distinguishes different drilling conditions from 
spindle motor current data. A third example distinguishes balanced and unbalanced 
pumping conditions from electrical power data. 

Further and other objects of the present invention will become apparent from the 
description contained herein. 

SUMMARY OF THE INVENTION 

In accordance with one aspect of the present invention, the foregoing and other 
objects are achieved by a method for automatically discriminating between similar but 
different states in a nonlinear process which comprises the steps of: operating a data 
provision means selected from the group consisting of data storage means and data 
acquisition means to provide at least one channel of nonlinear data, called e-data; 
separating the e-data into artifact data, called f-data, and artifact-free data, called g-data, 
while preventing phase distortions in the data; processing g-data through a filter to 
produce a filtered version of g-data, called h-data; applying at least one nonlinear 
measure to at least one type of data selected from the group consisting of e-data, f-data, 
g-data, and h-data to provide at least one time serial sequence of nonlinear measures from 
which at least one indicative trend selected from the group consisting of abrupt increases 
and abrupt decreases can be determined; comparing at least one indicative trend with at 
least one known discriminating indicator; determining from the comparison whether 
differences between similar but different states are indicated; and providing notification 
whether differences between similar but different states are indicated, the separating step, 
the processing step, the applying step, the comparing step, and the determining step being 
accomplishable in one integrated sequence of computer analyses. 

In accordance with a second aspect of the present invention, the foregoing and 
other objects are achieved by an apparatus for automatically discriminating between 
similar but different states in a nonlinear process which comprises: data provision means 
for providing at least one channel of nonlinear data, called e-data, the data provision 
means being selected from the group consisting of data storage means and data 
acquisition means; separation means for separating the e-data into artifact data, called 
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f-data, and artifact-free data, called g-data, while preventing phase distortions in the data, 
communicably connected to the data provision means; filter means for filtering g-data 
to produce a filtered version of g-data, called h-data, communicably connected to the 
separation means; application means for applying at least one nonlinear measure to at 
least one type of data selected from the group consisting of e-data, f-data, g-data, and h- 
data to provide at least one time serial sequence of nonlinear measures, from which at 
least one indicative trend selected from the group consisting of abrupt increases and 
abrupt decreases can be determined, communicably connected to the filter means; 
comparison means for comparing at least one indicative trend with at least one known 
discriminating indicator, communicably connected to the application means; 
determination means for determining from the comparison whether differences between 
similar but different states are indicated, communicably connected to the comparison 
means; and notification means for providing notification whether differences between 
similar but different states are indicated, communicably connected to the determination 
means, the separation means, the low pass filter means, the application means, the 
comparison means, and the determination means being operable in one integrated 
sequence of computer analyses. 

In accordance with a third aspect of the present invention, the foregoing and other 
objects are achieved by a method for automatically discriminating between similar but 
different states in a nonlinear process which comprises the steps of: operating a data 
provision means selected from the group consisting of data storage means and data 
acquisition means to provide at least one channel of nonlinear data, called e-data; 
separating the e-data into artifact data, called f-data, and artifact-free data, called g-data, 
while preventing phase distortions in the data; processing g-data through a filter to 
produce a filtered version of g-data, called h-data; applying the lag derived from the first 
minimum in the mutual information function to create a d-dimensional probability density 
function which forms a high-dimensional topology for at least one type of data selected 
from the group consisting of e-data, f-data, g-data, and h-data whereby at least one 
indicative trend can be determined; comparing at least one indicative trend with at least 
one known discriminating indicator; determining from the comparison whether 
differences between similar but different states are indicated; and providing notification 
whether differences between similar but different states are indicated, the separating -step, 
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the processing step, the applying step, the comparing step, and the determining step being 
accomplishable in one integrated sequence of computer analyses. 

In accordance with a fourth aspect of the present invention, the foregoing and 
other objects are achieved by a method for automatically discriminating between similar 
5 but different states in a nonlinear process which comprises the steps of: operating a data 
provision means selected from the group consisting of data storage means and data 
acquisition means to provide at least one channel of nonlinear data, called e-data; 
separating the e-data into artifact data, called f-data, and artifact-free data, called g-data, 
while preventing phase distortions in the data; processing g-data through a filter to 

10 produce a filtered version of g-data, called h-data; applying the lag derived from the first 
minimum in the mutual information function to create a d-dimensional probability density 
function which forms a high-dimensional topology for at least one type of data selected 
from the group consisting of e-data, f-data, g-data, and h-data whereby at least one 
indicative trend can be determined; generating a key from the set of indices of each bin 

15 in the multidimensional probability density function; representing occupied bins from 
the same class by a linked list to provide a reduction in the number of elements to be 
stored in the manner known as hashing whereby at least one hashed discriminating trend 
can be determined; comparing at least one indicative trend with at least one known 
discriminating indicator; determining from the comparison whether differences between 

20 similar but different states are indicated; and providing notification whether differences 
between similar but different states are indicated, the discriminating indicator being 
hashed, and the separating step, the processing step, the applying step, the generating 
step, the representing step, the comparing step, and the determining step being 
accomplishable in one integrated sequence of computer analyses. 

25 In accordance with a fifth aspect of the present invention, the foregoing and other 

objects are achieved by an apparatus for automatically discriminating between similar but 
different states in a nonlinear process which comprises: data provision means for 
providing at least one channel of nonlinear data, called e-data, said data provision means 
being selected from the group consisting of data storage means and data acquisition 

30 means; separation means for separating the e-data into artifact data, called f-data, and 
artifact-free data, called g-data, while preventing phase distortions in the data, 
communicably connected to the data provision means; filter means for filtering g-data 
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to produce a filtered version of g-data, called h-data, communicably connected to the 
separation means; application means for applying the lag derived from the first minimum 
in the mutual information function to create a d-dimensional probability density function 
which forms a high-dimensional topology for at least one type of data selected from the 
group consisting of e-data, f-data, g-data, and h-data, whereby at least one indicative 
trend can be determined, communicably connected to the filter means; comparison 
means for comparing at least one indicative trend with at least one known discriminating 
indicator, communicably connected to the application means; determination means for 
determining from the comparison whether differences between similar but different states 
are indicated, communicably connected to the comparison means; and notification means 
for providing notification whether differences between similar but different states are 
indicated, communicably connected to the determination means, the discriminating 
indicator being hashed and the separation means, the filter means, the application means, 
the comparison means, and the determination means being operable in one integrated 
sequence of computer analyses. 

In accordance with a sixth aspect of the present invention, the foregoing and other 
objects are achieved by an apparatus for automatically discriminating between similar but 
different states in a nonlinear process which comprises: data provision means for 
providing at least one channel of nonlinear data, called e-data, the data provision means 
being selected from the group consisting of data storage means and data acquisition 
means; separation means for separating the e-data into artifact data, called f-data, and 
artifact-free data, called g-data, while preventing phase distortions in the data, 
communicably connected to the data provision means; filter means for filtering g-data 
to produce a filtered version of g-data, called h-data, communicably connected to the 
separation means; application means for applying the lag derived from the first minimum 
in the mutual information function to create a d-dimensional probability density function 
which forms a high-dimensional topology for at least one type of data selected from the 
group consisting of e-data, f-data, g-data, and h-data whereby at least one indicative trend 
can be determined, communicably connected to the filter means; generation means for 
generating a key from the set of indices of each bin in the multidimensional probability 
density function, communicably connected to the application means; representation 
means for representing occupied bins from the same class by a linked list to provide a 



SUBSTITUTE SHEET (RULE 26) 



WO 98/49935 PCT/US98/09418 

7 

reduction in the number of elements to be stored in the manner known as hashing, 
whereby at least one hashed indicative trend can be determined, communicably connected 
to the generation means; comparison means for comparing at least one indicative trend 
with at least one known discriminating indicator, communicably connected to the 
representation means; determination means for determining from the comparison whether 
differences between similar but different states are indicated, communicably connected 
to the comparison means; and notification means for providing notification whether 
differences between similar but different states are indicated, communicably connected 
to the determination means, the discriminating indicator being hashed and the separation 
means, the filter means, the application means, the generation means, the representation 
means, the comparison means, and the determination means being operable in one 
integrated sequence of computer analyses. 

BRIEF DESCRIPTION OF THE DRAWINGS 
In the drawings: 

Figures la, lb, and lc are X s comparisons of base-case and cutset dynamics for 
an EEG dataset via the (a)two-, (b) three-, and (c) four-dimensional phase-space 
probability density function method. 

Figure 2 is a two-dimensional condition comparison versus X 2 ) for drilling 
data, with the following symbol meanings: solid dots (•) for reground new design 
drilling, open circles (o) for worn new design drilling, "x" for damaged new design 
drilling, and plus signs (+) for worn old design drilling. 

Figure 3 is a three-dimensional condition comparison (e 3 versus X 3 ) for drilling 
data, with the following symbol meanings: solid dots (•) for reground new design 
drilling, open circles (o) for worn new design drilling, "x" for damaged new design 
drilling, and plus signs (+) for worn old design drilling. 

Figure 4 is a four-dimensional condition comparison (e 4 versus XJ for drilling 
data, with the following symbol meanings: solid dots (•) for reground new design 
drilling, open circles (o) for worn new design drilling, "x" for damaged new design 
drilling, and plus signs (+) for worn old design drilling. 

Figure 5 is a two-dimensional condition comparison (X 2 versus time) of 
mechanically balanced and unbalanced pump conditions from motor power data, showing 
more than twenty standard deviations difference from the mean of a gaussian probability 
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distribution. This result is a clear indication of significant difference in the dynamics of 
the two pump states. 

Figure 6 is a flow chart which describes the steps and indicates the sequence of 
steps in one embodiment of the method of the present invention. 

For a better understanding of the present invention, together with other and further 
objects, advantages and capabilities thereof, reference is made to the following disclosure 
and appended claims in connection with the above-described drawings. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention is distinguished from the inventions described in U.S. Patent 
Application Serial No. 08/619,024, U.S. Patent Application Serial No. 08/619,030, and 
U.S. Patent Application Serial No. 08/619,031 which have been incorporated by reference 
above. The present invention provides both integrated methods and integrated apparatus 
for analyzing chaotic time series data which can be operated independent of human 
intervention in one integrated sequence of computer analyses to discriminate or 
distinguish states automatically. The process can function without human selection of 
data types or human selection of process steps. The previous methods and apparatus, 
described in U.S. Patent Application Serial No. 08/619,024, U.S. Patent Application 
Serial No. 08/619,030, and U.S. Patent Application Serial No. 08/619,031 depend upon 
a group of research-type non-linear mathematical analytical tools with human intervention 
to interpret or select data or process steps before subsequent steps can be taken. 

Although in the specific examples cited herein g-data is processed through a low- 
pass filter to separate a low-pass-filtered version of g-data called h-data, from high 
frequency data, it must also be noted that g-data can be processed through a high-pass 
filter to separate the low frequency data from high frequency filtered data in an alternate 
embodiment of the invention. 

The present invention also adds to the previous nonlinear methods a new set of 
method steps that is based on the phase-space representation, which Watt and Hameroff 
(3-4) showed to be a useful tool for brain dynamics. In addition, a novel hashing 
technique is incorporated to improve the phase-space probability distribution function 
(PDF), and to reduce the memory requirement for computers performing this type of 
. analysis. For example, without the hashing technique, the memory requirement for 5- 
dimensional PDFs is 5x50 5 = 625 million elements in the array, which is only available 
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in the world's largest, high-performance computers. The memory requirement is 2x50 6 
= 31 billion elements for 6-dimensional PDFs. The hashing method reduces the memory 
storage requirement for the 5-dimensional case by a factor of 380, allowing this case to 
run easily on modest scientific workstation-class computers. Moreover, the hashing 
5 technique reduces the execution time for the 4-dimensional PDF by a factor of 31, and 
reduces the array storage requirement by a factor of 9. 

PHASE-SPACE CONSTRUCTION 
Condition monitoring begins with experimental data that capture the process 
dynamics. This data must be acquired or retrieved from data storage before chaotic time 

10 series analysis can be applied. Data storage means may include but not be limited to 
magnetic media such as magnetic tape, computer disks or diskettes, and optical media 
such as CD ROM disks. Data Acquisition means may include but not be limited to 
electrocenpalogram, magnetoencephalogram, electrocardiogram, accelerometer, ammeter, 
voltmeter, ohm-meter, audiometer, light meter, pressure meter, thermometer, and the like. 

15 The implementation of this method typically requires at least 10,000 data points 

with at least 5 active bits of digitization for a robust statistical representation. The time- 
serial data (xj occur at equally spaced times ft). 

From the experimental data, a nonlinear statistical parameter, called the mutual 
information function (MIF) is extracted, which is a nonlinear analog of the auto- 

20 correlation function. MIF was formulated by Shannon and Weaver (1) and was applied 
to time series by Fraser and Swinney (2). MIF measures the average information (in bits) 
that can be inferred from one measurement about a second measurement, and is a 
function of the time delay between measurements. The first minimum in the MIF 
indicates the shortest time lag (X = tj - tj) when the signal, x(tj), is decorrelated from 

25 x(tj). A minimum is defined as two successive decreases in x(t), followed by two 
successive increases in x(t). To capture the process dynamics, the adequacy of the 
sampling rate is measured as A^IO timesteps and the average number of timesteps per 
cycle as £5. 

Next, the time-serial data is converted into a geometric (or phase-space) 
30 representation (3-4). A two-dimensional version of this phase-space plot involves 
construction of lines between the sequence of points (x,, x i+ J. The value for the index 
(i) runs through the dataset, and the lag (L = X) comes from the first minimum in the 
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mutual information function, as described in the previous paragraph. The resulting 
geometric figure is very complex, showing topological regions preferred and avoided by 
the dynamics. A three-dimensional diagram involves construction of lines between the 
sequence of points (Xj, x^ L , x^l). The lag ( L ) also comes from the first minimum in 
the MIF (i.e., L = X/2). The general case involves a d-dimensional point of the form, y(i) 
= [Xi, x* L , ... , x^jl ]. The lag is L = INT[0.5 + %/(d-l)], where INT(...) denotes 
conversion of a decimal number to the next smaller integer (e.g., 3.14 becomes 3). The 
highest dimensionality (d) for this representation occurs when INT[0.5 + X/(d-\)] = 1. 
The general geometric representation requires that the sequence of points be connected 
to form a d-dimensional topology, then subsequently compared to a companion phase- 
space diagram to determine a condition change. A quantitative visual comparison of 
phase-space representations is very difficult in two dimensions, and is essentially 
impossible in three or higher dimensions. 

CONSTRUCTION OF THE PHASE-SPACE PROBABILITY DENSITY 

FUNCTION 

Visualization of the three- (and higher-) dimensional phase-space is difficult. 
Consequently, the d-dimensional vector, y(i), is discretized into N equally spaced bins 
in each coordinate direction. Specifically, each component of y(i) is converted into an 
integer , J = INT[N(Xj - xj/fo - xj] + 1, where INT is the integer function (defined 
above) and 1 < J < N. The symbols, x, and x„, denote the maximum and minimum in 
the Xj values over the whole dataset, respectively. The number of bins (N) must be 
chosen to be consistent with the data precision (e.g., for 12-bits of precision, N must be 
less than 4096). A value of N=50 adequately represents the dynamics for the example 
cases. 

From this discretized version of the phase space, a probability density function 
(PDF) is constructed by incrementing the population of the appropriate d-dimensional bin 
by one, corresponding to the discretized coordinates of each y(i). This PDF does not 
have information about the connectivity between y(i) and y(i+l), which the phase space 
diagram does display. However, as noted above, the phase space diagram is complex and 
difficult to visualize in three or more dimensions. Moreover, high (low) PDF densities 
indicate domains that the process visits frequently (seldom). Thus, the topology of the 
PDF is equivalent to a phase-space diagram. 
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The subsequent discussion uses "PDFs" to indicate use of the above methodology 
for constructing two-, three-, and four-dimensional probability density functions. First, 
PDFs are constructed for a base case, with the notation: P 2 = two-dimensional PDF, P 3 
= three-dimensional PDF, P 4 = four-dimensional PDF. PDFs are obtained for an 
5 unknown condition from the same process, with the notation: Q 2 = two-dimensional PDF, 
Q 3 = three-dimensional PDF, Q 4 = four-dimensional PDF. 

COMPARISON OF TWO PHASE-SPACE PROBABILITY DENSITY 

FUNCTIONS 

Next, PDFs are compared for the unknown process state (Q t ) to the corresponding 
10 base-case PDFs (Pi). The occurrence frequency in each bin of Pj is scaled so that the 
total population (sum over all the bins) is the same as that in the total population of the 
corresponding Q r Thus, the population in each bin of Pj is scaled to cP, where the scale 
factor is c = (total population of Q,)/(total population in Pj). The base-case and 
unknown-case PDFs is compared by a x 2 statistic: 

15 

X 2 2 = Z [Q^ij) - P 2 (ij)] 2 / P 2 (ij) + (Q* - PJ 2 / Pfc (Eq. 1) 

20 

X, 2 = Z [Q 3 (ij,k) - P 3 (ij,k)] 2 / P 3 (ij,k) + (Q 3s - P 3$ ) 2 / P 3f (Eq. 2) 

25 

X* 2 - Z [Q 4 (ij,k,l) - P 4 (ij,k,l)] 2 / P 4 (ij,kJ) + (Q 4s - P 45 ) 2 /P 4s (Eq. 3) 

30 The first term in each equation is used if the denominator is >10. A bin 

population of <10 causes significant inaccuracy in x* 2 (5), in which case, we pool such 
sparse populations into one (or more) accumulation bin(s) for the base case (P u ) and for 
the cutset (QJ. We subsequently determine the contribution of these sparse populations 
to x 2 via the second term in each equation. We further note that a normal distribution 

35 approximates Xi 2 when the number of bins is large (>30), for which the normalization is 
Xj = (Xj 2 -o)/(2o)*S with u = number of bins contributing to the sum, minus one (6). 

A second measure is also used for comparative purposes based on the entropy of 
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the two-, three-, and four-dimensional PDFs: 



N 



= -(2 LnN)' 



r 1 2 P 2 (ij) ln[P 2 (ij)] 



(Eq. 4) 



N 



e 3 = -(2 InN) 



r' X P 3 (ij,k) ln[P J (ij,k)] 



(Eq. 5) 



N 



e 4 = -(2 InN)" 



r' 2 P 4 (ij,k,l) in[P 4 (ij,k,l)] 



(Eq. 6) 



The first factor in Equations 4-6 normalizes the entropy (e^ to have a range 
between zero and unity. PDF bins with zero population make no contribution to the sum 
because z ln(z) -> 0 in the limit of z -» 0. 

EXAMPLE I - RESULTS FOR BRAIN WAVE DATA 
This example began with sixteen channels of human bipolar-montage 
electroencephalogram (EEG) data in analog form on VHS tapes. 23-45 minutes of data 
were converted to digital form with 12-bit precision at a sampling rate of 512 hertz. 
Four epochs of data were examined: non-seizure, transition from non-seizure to seizure 
(transition), epileptic seizure, and post-seizure. Channel 13 which was closest to the 
patients' right eye, was chosen for analysis as a demonstration of the zero-phase, 
quadratic filtering method for removing low-frequency EEG artifacts due to eye blinks 
and muscle twitches. 

The above methodology was applied to obtain PDFs for brain activity. The value 
of the first minimum (X) in the mutual information function typically varies between 10- 
50 timesteps during the non-seizure period, so X is fixed at the value for the base case 
(the first 200 seconds of data) for all subsequent analysis. Figure 1 shows a f 
comparison of the base case to cutsets (20-second data segments, having a 10-second 
overlap with the previous cutset) for two-, three-, and four-dimensional PDFs. The plots 
show Xj (solid curve) for each cutset, along with a running average of X; (dashed curve) 
over eleven consecutive cutsets, plotted at the central (sixth) point of the averaging 
window. Xj is maximum during the clinical seizure, which occurs from 966 - 1035 
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seconds (between the two vertical lines in each subplot on either side of 1000 seconds). 
The horizontal line across each subplot lies at a gaussian probability of P(XJ = 0.999999, 
corresponding to a probability of 10 -6 that the PDFs are not different (at X t = 4.75). 
Most Xi values lie above 4.75, indicating high variability in brain dynamics, even in the 

5 non-seizure state. X; begins a monotonic rise to the seizure peak, beginning at 930 
seconds, consistent with the pre-seizure activity in the phase-space PDFs (not shown). 
Large excursions occur in X ( beginning at 550 seconds (Xj > 47.5). Xj remains above 
4.75 after 520 seconds for X 2 , and after 460 s for X 3 and X 4 . Beginning at 500 seconds, 
the average values of X 3 and X4 remain above the non-seizure average. These latter 

10 features are interpreted as seizure predictors. Thus, X 2 is sufficient to detect a seizure, 
while higher dimensionality PS-PDFs (X 3 and X<) are more sensitive as seizure 
predictors. This example is one of ten EEG datasets that have been analyzed with similar 
success. 

EXAMPLE n - RESULTS FOR DRILLING DATA 

15 This example began with one channel (15 seconds) of current data from one phase 

of a three-phase spindle motor during drilling of cast iron. The spindle speed was 29.2 
revolutions per second with a feed rate of 0.3556 millimeters per revolution. The data 
were low-pass filtered with a cutoff frequency of 20 kilohertz, and sampled at 60 
kilohertz. Twenty datasets were acquired for five successive cuts by 16.669 millimeter 

20 diameter carbide-tipped, titanium nitride coated, drill bits in four different conditions: 
reground new design, worn new design, damaged new design, and worn old design. 
Low-frequency variations at 220 hertz were removed with a zero-phase, quadratic 
filtering method, leaving a chaotic signal for subsequent analysis. 

The first cut by the reground new design drill bit was used as the base case. 

25 Application of the PS-PDF methodology to these datasets yielded a sequence of points, 
(Xj, ej, as shown in Figures 2-4. The sequence of cuts for each drill bit condition have 
the following symbol designations: solid dots (•) for reground new design drilling, open 
circles (o) for worn new design drilling, V for damaged new design drilling, and plus 
signs (+) for worn old design drilling. Lines connect the points for each bit condition, 

30 with arrows indicating the progression of drilling cuts during the experiment Figure 2 
shows a clear pattern through each cutting sequence in the (X 2 , ej) parameter space. The 
data for the reground new design forms a distinct cluster in the lower left corner of 
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Figure 2, but point clusters for the other bit conditions are intermingled. Figure 3 
displays the results for (X 3 , e 3 ), distinguishing among the four bit conditions, except for 
overlap between worn new design and damaged new design drilling. Figure 4 for (X,, 
e 4 ) is very similar to Figure 3, showing little discrimination improvement by using a four- 
dimensional PS-PDF. Thus, Figure 3 indicates that three-dimensional PS-PDF measures 
are sufficient to distinguish dynamics for successive drilling cuts, and to monitor different 

drill bit conditions. 

EXAMPLE m - RESULTS FOR PUMP POWER DATA 

This example began with one channel of total electrical power to a three-phase, 
two-pole motor, directly driving a five-vane centrifugal pump at 58.5 revolutions per 
second. The power data typically varied between 3,500 - 5,800 watts, spanning 14 
seconds at a sampling rate of 5 kilohertz (a total of 70,000 data points). The two datasets 
were for mechanically balanced and unbalanced conditions. 

Both datasets were divided into seven 10,000-point non-overlapping cutsets for 
seven separate condition comparisons. The above methodology was used with the i-th 
cutset for the balanced state as the base case, and the i-th cutset for the unbalanced state 
as the corresponding unknown. The phase-space topology forms a double loop for all 
of the cutsets, with slightly different gaps for the two conditions. Figure 5 shows that 
X 2 is >20 throughout the comparison. This result indicates a very significant difference 
between the datasets, equivalent to more than twenty standard deviations from the mean 
of a gaussian distribution. 

HASHING 

In addition, hashing (a well-known computer science technique for searching) 
improves the phase-space PDF method. A good reference for the hashing technique is 
The Art of Programming. Vol 3: Sorting and Searching, by Donald E. Knuth, (published 
by Addison- Wesley in 1973). Each bin in a multi-dimensional PDF has a set of indices. 
A key is generated from the set of indices (known as hashing), allowing the occupied 
bins to be classified. Occupied bins with different sets of indices may have the same 
key, in which case they belong to the same class known as collisions. The occupied bins 
from the same class are represented by a linked list, which is well-known to the skilled 
artisan as a technique for storing data in computers. By careful choice of hashing, the 
number of occupied bins with the identical keys is small. This approach provides an 
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efficient way to search for the existence of a specific bin. The typical number of data 
points n (10,000 - 200,000) gives an upper bound on the number of occupied bins, and 
in turn, allows one to estimate the amount of space required by the hashing technique. 
Specifically, the hashing technique requires at most [n (d+3)] elements, where d is the 

5 dimensionality of the PDF. The conventional storage scheme for a d-dimensional array 
requires N d elements, which translates to over 6 million elements for N=50, d=4, and 
n= 102,400, using the conventional storage scheme for a single 4-dimensional array. On 
the other hand, the hashing technique requires less than 0.75 million elements, a 
reduction by a factor of almost 9. For the 5-dimensional PDF, the hashing technique 

10 requires only about 0.82 million elements, corresponding to a reduction factor of more 
than 380 over the conventional storage scheme. The memory reduction factor for a 6- 
dimensional array is 16,954. 

The use of hashing not only reduces the data storage requirements, but also 
improves the computation speed. The hashing technique decreased execution time by a 

15 factor of 31 in the 4-dimensional PDF (including construction of and operations on the 
PDF) from 1,791 seconds to 58 seconds on a Sun Sparc workstation. Inclusion of the 
5-dimensional PDF increased the execution time to 67 seconds, which is very modest. 
Thus the hashing approach makes higher dimensional PDF calculations both feasible and 
practical. No prior art is known that uses hashing to reduce the array storage 

20 requirements for large multi-dimensional arrays in PDF calculations. Among the 
advantages of this invention are: novel application of hashing to decrease computer 
memory requirements for large, sparse, multi-dimensional arrays; and faster execution 
speed using the hashing technique for array storage. 

These features enable higher-dimensional PDF calculations on modest computers, 

25 that would otherwise require array storage that is available only on the world's largest, 
high-performance computers. An increase in dimensionality of the PDF arrays (e.g., from 
5- to 6- dimensional) raises the array storage requirements to values that exceed even the 
memory on world's largest, high-performance computers. Use of hashing in these cases 
allows the PDF calculations on modest workstation class computers to be performed. 

30 Moreover, this hashing approach is generally applicable to storage of sparse, high- 
dimensional arrays, which is an increasing important problem for multi-dimensional 
computational simulations (e.g., three dimensions in coordinate space and -three 
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dimensions in velocity space). 

THE STEPS OF A PREFERRED EMBODIMENT OF THE INVENTION 

The automated steps in the integrated methodology are summarized as follows: 

(Step 1) Obtain input data for subsequent analysis, including which of the below 

analyses to (or not to) perform; 
(Step 2) Select an appropriate sequence of time serial data values (xj, taken at 

equal time intervals (t;); 
(Step 3) Compute linear statistical measures for the data as a quality check, and 

terminate the analysis if the data is of inadequate quality, according to the 

below warnings; 

(Step 4) Apply the zero-phase, quadratic filter to the data, thus decomposing the 
signal into a low-frequency (artifact) and a separate artifact-free signal for 
subsequent analysis; 

(Step 5) Apply the standard fourth-order, low-pass filter to the data from (Step 4); 
(Step 6) For three channels of orthogonal acceleration data (a), integrate a with 
respect to time to obtain velocity (v), from which the vector dot product 
of velocity and acceleration yields power (p = P/m = a • v for a 
component mass, m) for subsequent analysis. It must be noted that 
experimental processes show a systematic decrease in the oscillatory signal 
for p, indicative of dissipation in the process. This systematic change is 
removed via application of the zero-phase, quadratic artifact filter; 
(Step 7) Integrate the power (p) from (Step 6) with respect to time to obtain the net 
energy (E) for each complete power cycle, as defined by either (a) two 
successive zero crossings in p going from positive-to-negative power 
(negative-starting energy, E^), or (b) two successive zero crossings in p 
going from negative-to-positive power (positive-starting energy, E* ); 
(Step 8) Use data from (Step 7) to compute the period-n return map for E{ and E*; 
(Step 9) Select a portion of the dataset as a base case from the results from (Step 
4),(Step 5), or (Step 7), as appropriate, obtain linear statistics, compute the 
first minimum (M,) in the mutual information function (MIF), from which 
the base case phase-space probability density function is then calculated; 
(Step 10) Select sequential subsets from the original data (cutsets of 10,000 or-more 
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data points); 

(Step 11) Obtain linear statistical measures for this cutset, including the minimum 
(xj, maximum (xj, average (x), standard deviation (a), absolute average 
deviation (a), skewness (s), kurtosis (k), and average number of timesteps 
5 per cycle (TJ; 

(Step 12) Compute the univariate mutual information function for the cutset, and 

. find the location of the first minimum (M,) in the mutual information; 
(Step 13) Compute the Fourier power spectral density for the cutset, and determine 
the frequency of the first (f { ) and second (f 2 ) largest peaks in the spectrum; 
10 (Step 14) Obtain the correlation dimension spectrum for the cutset; 
(Step 15) Calculate the Kolmogorov entropy spectrum for the cutset; 
(Step 16) Perform principal components analysis on the cutset data; 
(Step 17) Compute the phase-space PDF for the cutset, and obtain the corresponding 
X 2 difference between the PDFs for the cutset and the base-case; 
15 (Step 18) Compare linear statistics from (Step 3) to that from all of the cutsets (step 
1 1) as a further data quality check; 
(Step 19) Compute the bivariate mutual information function for the entire dataset 

of all combinations of multichannel data; 
(Step 20) Save the analysis results from the previous steps; 
20 (Step 21) Use specific quantitative criteria, (e.g., level crossings, warning messages) 
to display diagnostic and prognostic output. 
As seen specifically in reference to Figure 6, an alternative embodiment to the 
present invention is given in steps A through U, as follows: 
Step A - Provide nonlinear data 
25 Step B - Initialize analysis 

Step C - Input time serial data 
Step D - Apply artifact filter of data 
Step E - Calculate cycle energy from three channels of data 
Step F - Apply quality check of whole dataset 
30 Step G - Base-case data cutset 

Step H - Extract data cutset 
Step I - Apply low pass filter to data 
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Step J - Apply quality check to cutset 

Step K - Fourier spectrum 

Step L - Correlation dimension spectrum 

Step M - K-entropy spectrum 

Step N - Univariate MIF 

Step 0 - Return map 

Step P - Principal components analysis 

Step Q - Phase-space PDF 

Step R - Bivariate MIF 

Step S - Output results 

Step T - Determine whether differences are indicated 

Step U - Loop over cutsets (representing a loop between Step H and Step Q) 
Many of these steps are standard applications of computational methods, as one 
skilled in the art can appreciate. However, the novelty rests in the unique interaction 
among those computational components. Previously, each step was a distinct, research- 
class FORTRAN implementation. In the present embodiment, these steps are integrated 
into a single automated methodology, which uses results from the previous step for 
subsequent processing, as outlined above. The various steps use results from the 
preceding analysis as follows: 



This step 

Fourier power spectrum 
mutual information functions 



phase-space PDF 



Requires these results from previous analyses 
x and a 2 from linear statistics 
a, x„, and x x from linear statistics 

Mj from the univariate mutual information 
function 



X 2 comparison of PD-PDFs 



PS-PDFs for base case and cutset 



correlation dimension spectrum 



a and T c from linear statistics 

f { from the Fourier power spectral density 



Kolmogorov entropy spectrum 



a and T c from linear statistics 

f, from the Fourier power spectral density 



Principal components analysis 



x from linear statistics 

M, from the univariate mutual information 
function 
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Data quality check x, a, a, x,,, x,, s, k, and T c from linear statistics 

f, and f 2 from the Fourier power spectral density 

The data quality check consists of the following calculations: 
(step i) Determine the number of bits of active precision in the data by the 

maximum entropy method. Namely, the range of variation in the data (x„ 
< Xj < xj is divided into M=2 m equally spaced bins. Each data value (x^) 
is assigned to one such bin according to the formula, J = INT[M(Xj - xJ/( 
x x - xj + 1], where INT(...) coverts a real number to the next smaller 
integer [e.g., INT(3.14) = 3]. A one-dimensional probability density (D) 
function is constructed by incrementing the population of the J-th element, 
for each data value (x^. The corresponding normalized entropy is: 



e(m) = -(InM)- 1 E d In d (Eq. 7) 

where dj is the normalized probability density, dj = D/M. The entropy, 
e(m) is found for successively larger integer values of m < 16, from which 
the maximum in e(m) is obtained, corresponding to the number of active 
bits (B) of data precision. If B < 5, issue a warning. 

(step ii) Determine the apparent data precision via the formula, b = log 2 [(x 3l - 
xJ/AxJ, where Ax, is the magnitude of the smallest difference between 
any two values of x x and is 1 for integer data; 

(step iii) Compute the first difference, AXf = x^, - x { for the entire dataset, and 
obtain the minimum (AxJ and maximum (AxJ value of AX;. 
If Ax n > 0 over the whole dataset, issue a warning that the signal is only 
increasing; 

If Ax, < 0 over the whole dataset, issue a warning that the signal is only 
decreasing; 

If |AxJ > (x, - xJ/2, issue a warning that excessively large negative shifts 
occur; 

If |AxJ > (x, - xJ/2, issue a warning that excessively large positive shifts 
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occur; 

Then, discretize the AXj data via the formula, J = INT[M(Axj - AxJ/fAx* - 
AxJ + 1], and compute a one-dimension PDF as in calculation i), where 
M is smaller of the following two quantities: IFIXftAXj - AxJ/ Ax,+1] and 
2 B . Determine the bin with the largest population (N x ). Check the 
population of other bins in the fringes of the distribution function, that is 
when J < 0.1M or J > 0.9M. If the signal is not strongly periodic (i.e., the 
power in the Fourier spectral peaks corresponding to f, and f 2 is less than 
half of the total spectral power), then if the population in any one fringe 
bin is excessive (^N^IO), issue a warning. 

Also, determine the PDF bin index (J 0 ) that corresponds to A\ = 0, and 
check the bin populations for J < J 0 - M/5 and J > J 0 + M/5. If the 
signal is not strongly periodic (i.e., the power in the Fourier spectral peaks 
corresponding to f x and f 2 is less than half of the total spectral power), 
then if the population in any one outer bin is excessive (^Nj/10), issue a 
warning. 

(step iv) Use the one-dimensional PDF (P,) constructed in step i, corresponding to 
the active number of bits of data precision (B) from step i, as the base 
case. Use the same method as in step i to construct a one-dimensional PDF 
for each cutset (Q,). The occurrence frequency in each bin of P, is scaled 
so that the total population (sum of all ? } bins) is the same as the total 
population of Q,. Namely, each bin of P, is scaled cP,,by the factor c = 
(total population of Q } ) I (total population of P,). Then P t and Q, are 
compared by a x 2 statistic: 

x 2 = S (Q,(i)-P.(0) 2 / P,0>KQ„-Pu) 2 / P., (Eq- «) 

The first term in (Eq. 8) is used if P,(i) > 10. If P,(i) < 10, then the sparse bins 
are pooled into one (or more) accumulation bins(s) and compared by the second term in 
(Eq. 8). As before, the resulting X\ 7 measure is scaled to a normal distribution by X, = 
(X 2 - u) / (2o) l/2 , with u = number of non-empty P, bins in (Eq. 8) minus one. Issue a 
warning if X, > 100. 
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(step v) Warn of an inadequate sampling rate if T c (for whole dataset) < 5 
timesteps/cycle; 

(step vi) Issue a warning if the total number of data points is <1 0,000. 

As is obvious to one skilled in the art, the x 2 comparison can also be applied to 

5 multi-dimensional PDFs that are constructed from other nonlinear measures. One such 
application is principal components analysis, which reconstructs the nonlinear dynamics 
from experimental data. Abarbanel et al. (7) and Grassberger et al. (8) review this and 
related techniques. A sequence of data (Xj) is measured at equal intervals (x), beginning 
at some initial time (to), x { = x(to + it). Conversion of this data to a system of differential 

10 equations for the dynamics requires that the derivatives be approximated by finite 
differences of the data values [dx/dt = (x^, - xj/x, dVdt 2 = (x* 2 - 2x*, - Xj)/x 2 , etc.]. 
These and higher derivatives use the information that already resides in the ?q data, as 
multiples of other measurements at times that are multiples of the observation interval 
(x). A more direct method (9) uses lagged variables, x„ +ix = x(to + (n+iX)x), with X as an 

15 integer lag that is determined from the data. In particular, this method creates a vector 
of data in d dimensions: 

y( n ) = [Xn* Vx> Xn+K* — » X^d-l)*] 

Takens (10) found that faithful reconstruction of the system dynamics requires the 
embedding dimension (d) to be at least one plus twice the number of degrees of freedom 

20 (DOF). We measure the DOF for a given scale length as the correlation dimension (D), 
yielding an embedding dimension of d > 2D + 1. If the lag value is too small, then 
adjacent coordinates, x^ and x^,^ will be so close together in numerical value that 
they cannot be distinguished. If the lag value is too large, then these same adjacent 
coordinates are essentially independent due to the inherent unpredictability of nonlinear 

25 or chaotic systems after a long time. An optimal time length (2) should span enough data 
to go from the first maximum in the univariate MIF (maximum nonlinear correlation that 
always occurs at zero lag) to the first minimum in the MIF (minimum nonlinear 
correlation at M, as measured in timesteps). The vector in the last equation has a total 
of (d-1) X timesteps, so the lag value is obtained from M, = (d-1) X > (2D + 1 - 1) X = 

30 2DX, or 1 £ X £ M,/2D. When these constraints are satisfied, Takens' embedding theorem 
(10) guarantees that the resulting representation faithfully reconstructs the system 
dynamics. Broomhead and King (11) obtained the eigenvalues (Cj) of the covariance 
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matrix of y(n), which are all positive and can be ordered monotonically from largest to 
smallest. The larger eigenvalues correspond to the true system dynamics in dr dimensions 
and typically constitute > 99% of the sum over all the eigenvalues. Other eigenvalues 
arise from noise in d N dimensions with d = d T + d N , corresponding to a noise floor in a t 
versus i. The eigenvectors are orthogonal and are used to project out the independent 
dynamic components from the data, e.g., z^i) as the k-th dynamic component at the i-th 
timestep. The "principal components" match the larger eigenvalues. The system dynamics 
are converted to two- or three-dimensions by connecting the sequence of points (z,(i), 
ZjO)) for the first two principal components, or by connecting the triplet (z,(i), z^i), Zj(i)) 
for the first three principal components. Thus, as is apparent to one skilled in the art, the 
PDF that is constructed from principal components analysis can be used in the same way 
as the phase-space PDF for % 2 comparisons as a condition change monitor. 

Return map analysis is another form of nonlinear analysis to which % 2 
comparisons can also be applied for multi-dimensional PDFs. For example, we can 
analyze the cycle-to-cycle energy from the positive- and negative-starting power (p), 
respectively, yielding E* or E{ as a measure that characterizes the i-th cycle, or event. 
Evaluation of the time-serial power produces two, discrete sequences, {E,"\ E/, ... , E B *} 
and {E,-, E 2 \ ... , E 0 '}. These sequences move from one unstable periodicity to another 
in recurring patterns that are driven by the nonlinearity in the component dynamics. We 
extract these patterns in a "return map" by analyzing the energies in pairs (E^, versus 
Ej). Points near the diagonal, E Hm = E s , correspond to period-m motion because this 
location is visited every m-th cycle. Such points are also "unstable" because the system 
does not remain there, and "fixed" because the period-m location(s) do not change during 
repeated visits. Chaotic systems have an infinite number of unstable, periodic, fixed 
points. Moreover, stable and unstable directions span the region near an unstable fixed 
point in the shape of a multi-dimensional saddle. Return-map points approach an unstable 
fixed point along a locally-linear "stable" direction (or manifold), like a ball rolling stably 
toward the center of a saddle. Return-map points depart from an unstable fixed point 
along a locally linear "unstable" direction (or manifold), like a ball rolling rapidly off the 
side of a saddle. The distance between the fixed point and successive departing points 
increases exponentially, displaying sensitivity to initial conditions which is the hallmark 
of chaos. Characterization of this topology requires analysis of the data for multiple 
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approaches to and departures from the same fixed point along the same directions. As is 
obvious to one skilled in the art, the construction of such a return map in three- 
dimensions involves points (E t , , E Mm )> and (E i9 E^, E^^ in d dimensions. 
A sample of such d-dimension points constructs a probability density function using the 
same approach as above for the phase-space PDF and the principal components PDF. The 
physical meaning of the PDF is the same in all three instances: an n-dimensional 
topological representation of the nonlinear dynamics, which can subsequently be used for 
condition monitoring via the % 2 method. 

Possible alternative applications for this invention include: software algorithm on 
appropriate computers and hardware implementation of the same methodology, such as 
algorithm on a custom chip or on a digital signal processor. 

While there has been shown and described what are at present considered the 
preferred embodiments of the invention, it will be obvious to those skilled in the art that 
various changes and modifications can be made therein without departing from the scope 
of the inventions defined by the appended claims. 
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We claim: 

(1) A method for automatically discriminating between similar but different states in 
a nonlinear process comprising the steps of: 

(A) Operating a data provision means selected from the group consisting of 
data storage means and data acquisition means to provide at least one 
channel of nonlinear data, called e-data; 

(B) Separating the e-data into artifact data, called f-data, and artifact-free data, 
called g-data, while preventing phase distortions in the data; 

(C) Processing g-data through a filter to produce a filtered version of g-data, 
called h-data; 

(D) Applying at least one nonlinear measure to at least one type of data 
selected from the group consisting of e-data, f-data, g-data, and h-data to 
provide at least one time serial sequence of nonlinear measures from 
which at least one indicative trend selected from the group consisting of 
abrupt increases and abrupt decreases can be determined; 

(E) Comparing at least one indicative trend with at least one known 
discriminating indicator; 

(F) Determining from said comparison whether differences between similar 
but different states are indicated; and 

(G) Providing notification whether differences between similar but different 
states are indicated, 

Said steps B, C, D, E, and F being accomplishable in one integrated sequence of 
computer analyses. 

(2) The method as described in Claim 1 wherein said at least one time serial sequence 
of nonlinear measures is selected from the group consisting of: time per wave cycle for 
e-data, time per wave cycle for f-data, time per wave cycl.e for g-data, time per wave 
cycle for h-data, Kolmogorov entropy for e-data, Kolmogorov entropy for f-data, 
Kolmogorov entropy for g-data, Kolmogorov entropy for h-data, first minimum in the 
mutual information function for e-data, first minimum in the mutual information function 
for f-data, first minimum in the mutual information function for g-data, first minimum 
in the mutual information function for h-data, correlation dimension for e-data, 
correlation dimension for f-data, correlation dimension for g-data, correlation dimension 
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for h-data, and combinations thereof. 

(3) The method as described in Claim 1 wherein the e-data is separated into f-data 
and g-data by use of a zero-phase filter. 

(4) The method as described in Claim 1 wherein the filter comprises a standard low- 
pass filter selected from the group consisting of second-order, third-order, and fourth- 
order low-pass filters at frequencies between about 35 Hz and about 60 Hz. 

(5) The method as described in Claim 4 wherein the low-pass filter is a standard 
fourth-order low-pass filter at about 50 Hz. 

(6) Apparatus for automatically discriminating between similar but different states in 
a nonlinear process comprising: 

(A) Data provision means for providing at least one channel of nonlinear data, 
called e-data, said data provision means being selected from the group 
consisting of data storage means and data acquisition means; 

(B) Separation means for separating the e-data into artifact data, called f-data, 
and artifact-free data, called g-data, while preventing phase distortions in 
the data, communicably connected to said data provision means; 

(C) Filter means for filtering g-data to produce a filtered version of g-data, 
called h-data, communicably connected to said separation means; 

(D) Application means for applying at least one nonlinear measure to at least 
one type of data selected from the group consisting of e-data, f-data, g- 
data, and h-data to provide at least one time serial sequence of nonlinear 
measures, from which at least one indicative trend selected from the group 
consisting of abrupt increases and abrupt decreases can be determined, 
communicably connected to said filter means; 

(E) Comparison means for comparing at least one indicative trend with at least 
one known discriminating indicator, communicably connected to said 
application means; 

(F) Determination means for determining from said comparison whether 

differences between similar but different states are indicated, 
communicably connected to said comparison means; and 

(G) Notification means for providing notification whether differences between 
similar but different states are indicated, communicably connected to said 
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determination means, 
Said elements B, C, D, E, and F being operable in one integrated sequence of 
computer analyses. 

(7) The apparatus as described in Claim 6 wherein said at least one time serial 
sequence of nonlinear measures is selected from the group consisting of: time per wave 
cycle for e-data, time per wave cycle for f-data, time per wave cycle for g-data, time per 
wave cycle for h-data, Kolmogorov entropy for e-data, Kolmogorov entropy for f-data, 
Kolmogorov entropy for g-data, Kolmogorov entropy for h-data, first minimum in the 
mutual information function for e-data, first minimum in the mutual information function 
for f-data, first minimum in the mutual information function for g-data, first minimum 
in the mutual information function for h-data, correlation dimension for e-data, 
correlation dimension for f-data, correlation dimension for g-data, correlation dimension 
for h-data, and combinations thereof. 

(8) The apparatus as described in Claim 6 wherein the e-data is separated into f-data 
and g-data by use of a zero-phase filter. 

(9) The apparatus as described in Claim 8 wherein said filter means comprises a 
standard low-pass filter selected from the group consisting of second-order, third- 
order, and fourth-order low-pass filters at frequencies between about 35 Hz and 
about 60 Hz. 

(10) The apparatus as described in Claim 9 wherein said low-pass filter is a standard 
fourth-order low-pass filter at about 50 Hz. 

(1 1) A method for automatically discriminating between similar but different states in 
a nonlinear process comprising the steps of: 

(A) Operating a data provision means selected from the group consisting of 
data storage means and data acquisition means to provide at least one 
channel of nonlinear data, called e-data; 

(B) Separating the e-data into artifact data, called f-data, and artifact-free data, 
called g-data, while preventing phase distortions in the data; 

(C) Processing g-data through a filter to produce a filtered version of g-data, 
called h-data; 

(D) Applying the lag derived from the first minimum in the mutual 
information function to create a d-dimensional probability density function 
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which forms a high-dimensional topology for at least one type of data 
selected from the group consisting of e-data, f-data, g-data, and h-data 
whereby at least one indicative trend can be determined; 

(E) Comparing at least one indicative trend with at least one known 
5 discriminating indicator; 

(F) Determining from said comparison whether differences between similar 

but different states are indicated; and 

(G) Providing notification whether differences between similar but different 
states are indicated, 

10 Steps B, C, D, E, and F being accomplishable in one integrated sequence of 

computer analyses. 

(12) The method as described in Claim 11 wherein the e-data is separated into f-data 
and g-data by use of a zero-phase filter. 

(13) The method as described in Claim 12 wherein the filter comprises a standard low- 
15 pass filter selected from the group consisting of second-order, third-order, and 

fourth-order low-pass filters at frequencies between about 35 Hz and about 60 Hz. 

(14) The method as described in Claim 13 wherein the low-pass filter is a standard 
fourth-order low-pass filter at about 50 Hz. 

(15) A method for automatically discriminating between similar but different states in 
20 a nonlinear process comprising the steps of: 

(A) Operating a data provision means selected from the group consisting of 
data storage means and data acquisition means to provide at least one 
channel of nonlinear data, called e-data; 

(B) Separating the e-data into artifact data, called f-data, and artifact-free data, 
25 called g-data, while preventing phase distortions in the data; 

(C) Processing g-data through a filter to produce a filtered version of g-data, 
called h-data; 

(D) Applying the lag derived from the first minimum in the mutual 
information function to create a d-dimensional probability density function 

30 which forms a high-dimensional topology for at least one type of data 

selected from the group consisting of e-data, f-data, g-data, and h-data 
whereby at least one indicative trend can be determined; 
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(E) Generating a key from the set of indices of each bin in the 
multidimensional probability density function; 

(F) Representing occupied bins from the same class by a linked list to provide 

a reduction in the number of elements to be stored in the manner 
known as hashing whereby at least one hashed discriminating trend 
can be determined; 

(G) Comparing at least one indicative trend with at least one known 

discriminating indicator; 

(H) Determining from said comparison whether differences between similar 
but different states are indicated; and 

(I) Providing notification whether differences between similar but different 
states are indicated, 

Said discriminating indicator being hashed, and Steps B, C, D, E, F, G, and H 
being accomplishable in one integrated sequence of computer analyses. 
(16) An apparatus for automatically discriminating between similar but different states 
in a nonlinear process comprising: 

(A) Data provision means for providing at least one channel of nonlinear data, 
called e-data, said data provision means being selected from the group 
consisting of data storage means and data acquisition means; 

(B) Separation means for separating the e-data into artifact data, called f-data, 
and artifact-free data, called g-data, while preventing phase distortions in 
the data, communicably connected to said data provision means; 

(C) Filter means for filtering g-data to produce a filtered version of g-data, 
called h-data, communicably connected to said separation means; 

(D) Application means for applying the lag derived from the first minimum in 
the mutual information function to create a d-dimensional probability 
density function which forms a high-dimensional topology for at least one 
type of data selected from the group consisting of e-data, f-data, g-data, 
and h-data whereby at least one indicative trend can be determined, 
communicably connected to said filter means; 

(E) Comparison means for comparing at least one indicative trend with at least 
one known discriminating indicator, communicably connected to said 
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application means; 

(F) Determination means for determining from the comparison whether 
differences between similar but different states are indicated, 
communicably connected to said comparison, means; and 
5 (G) Notification means for providing notification whether differences between 

similar but different states .are indicated, communicably connected to said 
determination means, 

Said discriminating indicator being hashed and said elements B, C, D, E, and F 

being operable in one integrated sequence of computer analyses. 
10 (17) The apparatus as described in Claim 16 wherein the e-data is separated into f-data 

and g-data by use of a zero-phase filter. 

(18) The apparatus as described in Claim 17 wherein the filter means comprises a 
standard low-pass filter selected from the group consisting of second-order, third- 
order, and fourth-order low-pass filters at frequencies between about 35 Hz and 

15 about 60 Hz. 

(19) The apparatus as described in Claim 18 wherein the low-pass filter is a standard 
fourth-order low-pass filter at about 50 Hz. 

(20) An apparatus for automatically discriminating between similar but different states 
in a nonlinear process comprising: 

20 (A) Data provision means for providing at least one channel of nonlinear data, 

called e-data, said data provision means being selected from the group 
consisting of data storage means and data acquisition means; 

(B) Separation means for separating the e-data into artifact data, called f-data, 
and artifact-free data, called g-data, while preventing phase distortions in 

25 the data, communicably connected to said data provision means; 

(C) Filter means for filtering g-data to produce a filtered version of g-data, 
called h-data, communicably connected to said separation means; . 

(D) Application means for applying the lag derived from the first minimum in 
the mutual information function to create a d-dimensional probability 

30 density function which forms a high-dimensional topology for at least one 

type of data selected from the group consisting of e-data, f-data, g-data, 
and h-data whereby at least one indicative trend can be determined, 
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communicably connected to said filter means; 

(E) Generation means for generating a key from the set of indices of each bin 
in the multidimensional probability density function, communicably 
connected to said application means; 

(F) Representation means for representing occupied bins from the same class 
by a linked list to provide a reduction in the number of elements to be 
stored in the manner known as hashing, whereby at least one hashed 
indicative trend can be determined, communicably connected to said 
generation means; 

(G) Comparison means for comparing at least one indicative trend with at least 
one known discriminating indicator, communicably connected to said 
representation means; 

(H) Determination means for determining from said comparison whether 
differences between similar but different states are indicated, 
communicably connected to said comparison means; and 

(I) Notification means for providing notification whether differences between 
similar but different states are indicated, communicably connected to said 
determination means, 

Said discriminating indicator being hashed, and Elements B, C, D, E, F, G, and 
H being operable in one integrated sequence of computer analyses. 
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